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8.1 Normal Distribution: Introduction TCC 


Student Learning Outcomes 
By the end of this chapter, the student should be able to: 


e Recognize the normal probability distribution and apply it 
appropriately. 

e Recognize the standard normal probability distribution and apply it 
appropriately. 

e Compare normal probabilities by converting to the standard normal 
distribution. 


Introduction 


The normal, a continuous distribution, is the most important of all the 
distributions. It is widely used and even more widely abused. Its graph is 
bell-shaped. You see the bell curve in almost all disciplines. Some of these 
include psychology, business, economics, the sciences, nursing, and, of 
course, mathematics. Some of your instructors may use the normal 
distribution to help determine your grade. Most IQ scores are normally 
distributed. Often real estate prices fit a normal distribution. The normal 
distribution is extremely important but it cannot be applied to everything in 
the real world. 


In this chapter, you will study the normal distribution, the standard normal, 
and applications associated with them. 


Optional Collaborative Classroom Activity 


Your instructor will record the heights of both men and women in your 
class, separately. Draw histograms of your data. Then draw a smooth curve 
through each histogram. Is each curve somewhat bell-shaped? Do you think 
that if you had recorded 200 data values for men and 200 for women that 
the curves would look bell-shaped? Calculate the mean for each data set. 
Write the means on the x-axis of the appropriate graph below the peak. 


Shade the approximate area that represents the probability that one 
randomly chosen male is taller than 72 inches. Shade the approximate area 
that represents the probability that one randomly chosen female is shorter 
than 60 inches. If the total area under each curve is one, does either 
probability appear to be more than 0.5? 


The normal distribution has two parameters (two numerical descriptive 
measures), the mean (jz) and the standard deviation (a). If X is a quantity to 
be measured that has a normal distribution with mean (jz) and the standard 
deviation (a), we designate this by writing 


NORMAL: X~N(u, 0) 


The probability density function is a rather complicated function. Do not 
memorize it. It is not necessary. 


f(e) =e HON 


The cumulative distribution function is P(X < 2) . It is calculated either 
by a calculator or a computer or it is looked up in a table. See the NOTE in 
this chapter in Calculation of Probabilities. 


The curve is symmetrical about a vertical line drawn through the mean, LL. 
In theory, the mean is the same as the median since the graph is symmetric 
about pt. As the notation indicates, the normal distribution depends only on 
the mean and the standard deviation. Since the area under the curve must 
equal one, a change in the standard deviation, o, causes a change in the 
shape of the curve; the curve becomes fatter or skinnier depending on o. A 


change in | causes the graph to shift to the left or right. This means there 
are an infinite number of normal probability distributions. One of special 
interest is called the standard normal distribution. 


Glossary 


Normal Distribution 
A continuous random variable (RV) with pdf 


f(x)'= TE e~(-n)"/20° where ps is the mean of the distribution and 


2m 
o is the standard deviation. Notation: X ~ N(y, 0). If uw = 0 and 
o = 1, the RV is called the standard normal distribution. 


8.2 Normal Distribution: Standard Normal Distribution TCC 


The standard normal distribution is a normal distribution of 
standardized values called z-scores. A z-score is measured in units of 
the standard deviation. For example, if the mean of a normal distribution 
is 5 and the standard deviation is 2, the value 11 is 3 standard deviations 
above (or to the right of) the mean. The calculation is: 

Equation: 


The z-score is 3. 


The mean for the standard normal distribution is 0 and the standard 
deviation is 1. The transformation 


z= =* produces the distribution Z~ N(0,1) —. The value x comes 


from a normal distribution with mean p and standard deviation o. 


Glossary 


Standard Normal Distribution 
A continuous random variable (RV) X~N(0,1 ).. When X follows the 
standard normal distribution, it is often noted as Z~N(0,1). 


z-score 
The linear transformation of the form z = =. If this transformation 
is applied to any normal distribution X~N(,c) , the result is the 
standard normal distribution Z~N (0,1). If this transformation is 
applied to any specific value x of the RV with mean p and standard 
deviation o , the result is called the z-score of x. Z-scores allow us to 
compare data that are normally distributed but scaled differently. 


8.3 Normal Distribution: Z-scores TCC 


If X is a normally distributed random variable and X~N(u, o), then the z- 
score is: 
Equation: 


The z-score tells you how many standard deviations that the value z is 
above (to the right of) or below (to the left of) the mean, jz. Values of z 
that are larger than the mean have positive z-scores and values of x that are 
smaller than the mean have negative z-scores. If x equals the mean, then x 
has a z-score of 0. 


Example: 

Suppose X ~ N(5, 6). This says that X is a normally distributed random 
variable with mean pp = 5 and standard deviation o = 6. Suppose x = 17. 
Then: 

Equation: 


This means that x = 17 is 2 standard deviations (20) above or to the 
right of the mean pt = 5. The standard deviation is o = 6. 

Notice that: 

Equation: 


Do 2 16 —vls, (The pattern is + zo = 2.) 


Now suppose x=1. Then: 
Equation: 


= ie 
= —- = —0.67 (rounded to two decimal places) 
o 


This means that x = 1 is 0.67 standard deviations (- 0.670) below or to 
the left of the mean pi = 5. Notice that: 

5 + (—0.67)(6) is approximately equal to 1 (This has the pattern 
p+ (—0.67)o = 1) 

Summarizing, when z is positive, x is above or to the right of and when 
zis negative, x is to the left of or below p. 


Example: 

Some doctors believe that a person can lose 5 pounds, on the average, in a 
month by reducing his/her fat intake and by exercising consistently. 
Suppose weight loss has a normal distribution. Let X = the amount of 
weight lost (in pounds) by a person in a month. Use a standard deviation of 
2 pounds. X~N(5, 2). Fill in the blanks. 

Exercise: 


Problem: 


Suppose a person lost 10 pounds in a month. The z-score when 
x = 10 pounds is z = 2.5 (verify). This z-score tells you that x = 10 


is standard deviations to the (right or left) of the 
mean (What is the mean?). 
Solution: 


This z-score tells you that x = 10 is 2.5 standard deviations to the 
right of the mean 5. 


Exercise: 
Problem: 
Suppose a person gained 3 pounds (a negative weight loss). Then z = 


. This z-score tells you that x = -3 is standard 
deviations to the (right or left) of the mean. 


Solution: 


= -4. This z-score tells you that x = -3 is 4 standard deviations to 
the left of the mean. 
Suppose the random variables X and Y have the following normal 
distributions: X ~N(5, 6) and Y ~ N(2, 1). Ifx = 17, then z = 2. (This 
was previously shown.) If y = 4, what is z? 
Equation: 


= SS where p}=2 and o=1. 


The z-score for y = 4 is z = 2. This means that 4 is z = 2 standard 
deviations to the right of the mean. Therefore, x = 17 and y = 4 are both 2 
(of their) standard deviations to the right of their respective means. 

The z-score allows us to compare data that are scaled differently. To 
understand the concept, suppose X ~N(5, 6) represents weight gains for 
one group of people who are trying to gain weight in a 6 week period and 
Y ~N(2, 1) measures the same weight gain for a second group of people. 
A negative weight gain would be a weight loss. Since x = 17 and y = 4 
are each 2 standard deviations to the right of their means, they represent 
the same weight gain relative to their means. 


The Empirical Rule 
If X is arandom variable and has a normal distribution with mean p and 
standard deviation o then the Empirical Rule says (See the figure below) 


e About 68.27% of the x values lie between -1o0 and +10 of the mean p 
(within 1 standard deviation of the mean). 

e About 95.45% of the x values lie between -20 and +20 of the mean p 
(within 2 standard deviations of the mean). 

e About 99.73% of the x values lie between -30 and +30 of the mean p 
(within 3 standard deviations of the mean). Notice that almost all the x 
values lie within 3 standard deviations of the mean. 

e The z-scores for +1o and —1o are +1 and -1, respectively. 

e The z-scores for +20 and —2o0 are +2 and -2, respectively. 

e The z-scores for +30 and —3a0 are +3 and -3 respectively. 


xX 
—30-—20-lo wp lo 20 30 


The Empirical Rule is also known as the 68-95-99.7 Rule. 


Example: 


Suppose X has a normal distribution with mean 50 and standard deviation 
6. 


e About 68.27% of the x values lie between -1o = (-1)(6) = -6 and lo = 
(1)(6) = 6 of the mean 50. The values 50 - 6 = 44 and 50 + 6 = 56 are 
within 1 standard deviation of the mean 50. The z-scores are -1 and 
+1 for 44 and 56, respectively. 

e About 95.45% of the x values lie between -20 = (-2)(6) = -12 and 20 
= (2)(6) = 12 of the mean 50. The values 50 - 12 = 38 and 50 + 12 = 
62 are within 2 standard deviations of the mean 50. The z-scores are 
-2 and 2 for 38 and 62, respectively. 

e About 99.73% of the x values lie between -3o0 = (-3)(6) = -18 and 30 
= (3)(6) = 18 of the mean 50. The values 50 - 18 = 32 and 50 + 18 = 
68 are within 3 standard deviations of the mean 50. The z-scores are 
-3 and +3 for 32 and 68, respectively. 


8.4 Calculations of Probabilities TCC 


Probabilities are calculated by using technology. There are instructions in 
the chapter for the TI-83+ and TI-84 calculators. 


Note:In the Table of Contents for Collaborative Statistics, entry 15. 
Tables has a link to a table of normal probabilities. Use the probability 
tables if so desired, instead of a calculator. The tables include instructions 
for how to use then. 


Example: 
If the area to the left is 0.0228, then the area to the right is 
1 — 0.0228 = 0.9772. 


Example: 

The final exam scores in a statistics class were normally distributed with a 
mean of 63 and a standard deviation of 5. 

Exercise: 


Problem: 


Find the probability that a randomly selected student scored more than 
65 on the exam. 


Solution: 


Let X = ascore on the final exam. X~N(63, 5), where ps = 63 and 
(5) 


Draw a graph. 


Then, find P(« > 65). 


P(x > 65) = 0.3446 (calculator or computer) 


0.3446 


a 


The probability that one student scores more than 65 is 0.3446. 


63 65 


Using the TI-83+ or the TI-84 calculators, the calculation is as 
follows. Go into 2nd DISTR. 


After pressing 2nd DISTR, press 2:normalcdf. 
The syntax for the instructions are shown below. 


normalcdf(lower value, upper value, mean, standard deviation) For 
this problem: normalcdf(65,1E99,63,5) = 0.3446. You get 1E99 ( = 
10°°) by pressing 1, the EE key (a 2nd key) and then 99. Or, you can 
enter 10/99 instead. The number 10% is way out in the right tail of 
the normal curve. We are calculating the area between 65 and 10°”. In 
some instances, the lower number of the area might be -1E99 ( = 
—10%). The number —10% is way out in the left tail of the normal 
curve. 


Note:The TI probability program calculates a z-score and then the 
probability from the z-score. Before technology, the z-score was 
looked up in a standard normal probability table (because the math 
involved is too cumbersome) to find the probability. In this example, 


a standard normal table with area to the left of the z-score was used. 
You calculate the z-score and look up the area to the left. The 
probability is the area to the right. 


ea S88 — 0.4 . Area to the left is 0.6554. 
(pe Oa) 0 le ood Ur ado 


Exercise: 


Problem: 


Find the probability that a randomly selected student scored less than 
85. 


Solution: 


Draw a graph. 


Then find P(x < 85). Shade the graph. P(a < 85) = 1 (calculator 
or computer) 


The probability that one student scores less than 85 is approximately 1 
(or 100%). 


The TI-instructions and answer are as follows: 


normalcdf(0,85,63,5) = 1 (rounds to 1) 
Exercise: 


Problem: 


Find the 90th percentile (that is, find the score k that has 90 % of the 
scores below k and 10% of the scores above k). 


Solution: 


Find the 90th percentile. For each problem or part of a problem, draw 
a new graph. Draw the x-axis. Shade the area that corresponds to the 
90th percentile. 


Let k = the 90th percentile. & is located on the x-axis. P(x < k) is 
the area to the left of k. The 90th percentile k separates the exam 
scores into those that are the same or lower than & and those that are 
the same or higher. Ninety percent of the test scores are the same or 
lower than & and 10% are the same or higher. k is often called a 
critical value. 


k = 69.4 (calculator or computer) 


P(x < k) = 0.90 


The 90th percentile is 69.4. This means that 90% of the test scores fall 
at or below 69.4 and 10% fall at or above. For the TI-83+ or TI-84 
calculators, use LnvNormin 2nd DISTR. invNorm(area to the left, 
mean, standard deviation) For this problem, invNorm(0.90,63,5) = 
69.4 


Exercise: 


Problem: 


Find the 70th percentile (that is, find the score k such that 70% of 
scores are below k and 30% of the scores are above k). 


Solution: 


Find the 70th percentile. 
Draw a new graph and label it appropriately. k = 65.6 


The 70th percentile is 65.6. This means that 70% of the test scores fall 
at or below 65.5 and 30% fall at or above. 


invNorm(0.70,63,5) = 65.6 


Example: 

A computer is used for office work at home, research, communication, 
personal finances, education, entertainment, social networking and a 
myriad of other things. Suppose that the average number of hours a 
household personal computer is used for entertainment is 2 hours per day. 
Assume the times for entertainment are normally distributed and the 
standard deviation for the times is half an hour. 

Exercise: 


Problem: 


Find the probability that a household personal computer is used 
between 1.8 and 2.75 hours per day. 


Solution: 


Let X = the amount of time (in hours) a household personal computer 
is used for entertainment. x~N(2, 0.5) where « = 2 and o = 0.5. 


Find P(1.8 < x < 2.75). 


The probability for which you are looking is the area between 
Les AMC aN) peel Out 2 id) OOO 


18 5 2.75. Xx 


normalcdf(1.8,2.75,2,0.5) = 0.5886 
The probability that a household personal computer is used between 
1.8 and 2.75 hours per day for entertainment is 0.5886. 
Exercise: 
Problem: 


Find the maximum number of hours per day that the bottom quartile 
of households use a personal computer for entertainment. 


Solution: 


To find the maximum number of hours per day that the bottom 
quartile of households uses a personal computer for entertainment, 
find the 25th percentile, k, where P(x < k) = 0.25. 


k= 1.67 


P(x > k) = 0.75 
P(x <k) = 0.25 ia 


invNorm(0.25,2,.5) = 1.66 


The maximum number of hours per day that the bottom quartile of 
households uses a personal computer for entertainment is 1.66 hours. 


8.5 Normal Distribution: Practice TCC 


Student Learning Outcomes 


e The student will analyze data following a normal distribution. 


Given 


The life of Sunshine CD players is normally distributed with a mean of 4.1 
years and a standard deviation of 1.3 years. A CD player is guaranteed for 3 
years. We are interested in the length of time a CD player lasts. 


Normal Distribution 
Exercise: 


Problem: Define the Random Variable X in words. X 


Exercise: 


Problem: X~ 
Exercise: 


Problem: 


Find the probability that a CD player will break down during the 
guarantee period. 


e a Sketch the situation. Label and scale the axes. Shade the region 
corresponding to the probability. 


«bP ay (Use zero (0) for the 
minimum value of x.) 


Solution: 
eb 
Exercise: 
Problem: 
Find the probability that a CD player will last between 2.8 and 6 years. 


e a Sketch the situation. Label and scale the axes. Shade the region 
corresponding to the probability. 


e bP r 


Solution: 
e b 
Exercise: 
Problem: 


Find the 70th percentile of the distribution for the time a CD player 
lasts. 


e a Sketch the situation. Label and scale the axes. Shade the region 
corresponding to the lower 70%. 


“bPar k . Therefore, k 


Solution: 


eb years 


8.6 Normal Distribution: Homework TCC 
Exercise: 


Problem: 


According to a study done by De Anza students, the height for Asian 
adult males is normally distributed with an average of 66 inches and a 
standard deviation of 2.5 inches. Suppose one Asian adult male is 
randomly chosen. Let X =height of the individual. 


e aX~ ( ; ) 

e b Find the probability that the person is between 65 and 69 
inches. Include a sketch of the graph and write a probability 
statement. 

e¢ c Would you expect to meet many Asian adult males over 72 
inches? Explain why or why not, and justify your answer 
numerically. 

e d The middle 40% of heights fall between what two values? 
Sketch the graph and write the probability statement. 


Solution: 


e a N(66,2.5) 

e b0.5404 

e cNo 

e dBetween 64.7 and 67.3 inches 


Exercise: 


Problem: 


IQ is normally distributed with a mean of 100 and a standard deviation 
of 15. Suppose one individual is randomly chosen. Let X =IQ of an 
individual. 


e a X~ ( : ) 
e b Find the probability that the person has an IQ greater than 120. 
Include a sketch of the graph and write a probability statement. 


¢ c Mensa is an organization whose members have the top 2% of all 
IQs. Find the minimum IQ needed to qualify for the Mensa 
organization. Sketch the graph and write the probability 
statement. 

e d The middle 50% of IQs fall between what two values? Sketch 
the graph and write the probability statement. 


Exercise: 


Problem: 


The percent of fat calories that a person in America consumes each 
day is normally distributed with a mean of about 36 and a standard 
deviation of 10. Suppose that one individual is randomly chosen. Let 
X =percent of fat calories. 


e a X~ ( F ) 

e b Find the probability that the percent of fat calories a person 
consumes is more than 40. Graph the situation. Shade in the area 
to be determined. 

e c Find the maximum number for the lower quarter of percent of 
fat calories. Sketch the graph and write the probability statement. 


Solution: 


¢ a N(36,10) 
° b 0.3446 
© ¢ 29.3 


Exercise: 
Problem: 
Suppose that the distance of fly balls hit to the outfield (in baseball) is 


normally distributed with a mean of 250 feet and a standard deviation 
of 50 feet. 


e alf X = distance in feet for a fly ball, then X~ 
( ) 

¢ b If one fly ball is randomly chosen from this distribution, what is 
the probability that this ball traveled fewer than 220 feet? Sketch 
the graph. Scale the horizontal axis X. Shade the region 
corresponding to the probability. Find the probability. 

e c Find the 80th percentile of the distribution of fly balls. Sketch 
the graph and write the probability statement. 


Exercise: 


Problem: 


In China, 4-year-olds average 3 hours a day unsupervised. Most of the 
unsupervised children live in rural areas, considered safe. Suppose that 
the standard deviation is 1.5 hours and the amount of time spent alone 
is normally distributed. We randomly survey one Chinese 4-year-old 
living in a rural area. We are interested in the amount of time the child 
spends alone per day. (Source: San Jose Mercury News) 


e alin words, define the random variable X.X = 

© bxX~ 

e c Find the probability that the child spends less than 1 hour per 
day unsupervised. Sketch the graph and write the probability 
statement. 

e d What percent of the children spend over 10 hours per day 
unsupervised? 

e e 70% of the children spend at least how long per day 
unsupervised? 


Solution: 


e a the time (in hours) a 4-year-old in China spends unsupervised 
per day 

¢ b N(3,1.5) 

e c 0.0912 

e d0 


e 2.21 hours 


Exercise: 


Problem: 


In the 1992 presidential election, Alaska’s 40 election districts 
averaged 1956.8 votes per district for President Clinton. The standard 
deviation was 572.3. (There are only 40 election districts in Alaska.) 
The distribution of the votes per district for President Clinton was bell- 
shaped. Let X = number of votes for President Clinton for an election 
district. (Source: The World Almanac and Book of Facts) 


a State the approximate distribution of X. X~ 

b Is 1956.8 a population mean or a sample mean? How do you 
know? 

c Find the probability that a randomly selected district had fewer 
than 1600 votes for President Clinton. Sketch the graph and write 
the probability statement. 

d Find the probability that a randomly selected district had 
between 1800 and 2000 votes for President Clinton. 

e Find the third quartile for votes for President Clinton. 


Exercise: 


Problem: 


Suppose that the duration of a particular type of criminal trial is known 
to be normally distributed with a mean of 21 days and a standard 
deviation of 7 days. 


a In words, define the random variable X. X = 

b X~ 

c If one of the trials is randomly chosen, find the probability that 
it lasted at least 24 days. Sketch the graph and write the 
probability statement. 

d 60% of all of these types of trials are completed within how 
many days? 


Solution: 


e aThe duration of a criminal trial 
¢ b N(21,7) 

¢ ¢ 0.3341 

e d 22.77 


Exercise: 


Problem: 


Terri Vogel, an amateur motorcycle racer, averages 129.71 seconds per 
2.5 mile lap (in a 7 lap race) with a standard deviation of 2.28 seconds 
. The distribution of her race times is normally distributed. We are 
interested in one of her randomly selected laps. (Source: log book of 
Terri Vogel) 


e a ln words, define the random variable X.X = 
© bxXe 
e c Find the percent of her laps that are completed in less than 130 
seconds. 
e d The fastest 3% of her laps are under ; 
e e The middle 80% of her laps are from seconds to 
seconds. 


Exercise: 


Problem: 


Thuy Dau, Ngoc Bui, Sam Su, and Lan Voung conducted a survey as 
to how long customers at Lucky claimed to wait in the checkout line 
until their turn. Let _X =time in line. Below are the ordered real data 
(in minutes): 


0.50 4.25 2) 6 720 


1.75 4.25 5.25 6 peas 
4.25 D205 6.25 Ao 
2:25 4.25 ops) 6.25 719 
229 4.5 5.0 6.5 8 
2.5 4.75 5.0 6.5 8.25 
2:15 4.75 D./0 6.5 9.5 
3.20 4.75 9.75 6.75 9.5 
3.75 fs) 6 6.75 O37) 
3.75 fs) 6 6.75 10.75 


a Calculate the sample mean and the sample standard deviation. 
b Construct a histogram. Start the x — axis at —0.375 and make 
bar widths of 2 minutes. 

c Draw a smooth curve through the midpoints of the tops of the 
bars. 

d In words, describe the shape of your histogram and smooth 
curve. 

e Let the sample mean approximate pz and the sample standard 
deviation approximate o. The distribution of X can then be 
approximated by X~ 

f Use the distribution in (e) to calculate the probability that a 
person will wait fewer than 6.1 minutes. 

g Determine the cumulative relative frequency for waiting less 
than 6.1 minutes. 

h Why aren’t the answers to (f) and (g) exactly the same? 

i Why are the answers to (f) and (g) as close as they are? 


e j If only 10 customers were surveyed instead of 50, do you think 
the answers to (f) and (g) would have been closer together or 
farther apart? Explain your conclusion. 


Solution: 


e a The sample mean is 5.51 and the sample standard deviation is 
215 

e e N(5.51,2.15) 

¢ f 0.6081 

e g 0.64 


Exercise: 


Problem: 


Suppose that Ricardo and Anita attend different colleges. Ricardo’s 
GPA is the same as the average GPA at his school. Anita’s GPA is 0.70 
standard deviations above her school average. In complete sentences, 
explain why each of the following statements may be false. 


e a Ricardo’s actual GPA is lower than Anita’s actual GPA. 
e b Ricardo is not passing since his z-score is zero. 
e c Anita is in the 70th percentile of students at her college. 


Exercise: 


Problem: 


Below is a sample of the maximum capacity (maximum number of 
spectators) of sports stadiums. The table does not include horse racing 
or motor racing stadiums. (Source: 
http://en.wikipedia.org/wiki/List_of_stadiums_by_capacity) 


40,000 
49,133 
51,500 
52,692 
59,000 
59,680 
62,872 
66,161 
70,585 


75,025 


40,000 
50,071 
51,900 
53,864 
59,000 
60,000 
64,035 
67,428 
71,594 


76,212 


45,050 
50,096 
52,000 
54,000 
55,000 
60,000 
65,000 
68,349 
72,000 


78,000 


45,500 
50,466 
52,132 
55,000 
55,082 
60,492 
65,050 
68,976 
72,922 


80,000 


46,249 
50,832 
52,200 
59,000 
57,000 
60,580 
65,647 
69,372 
735379 


80,000 


48,134 
51,100 
52,530 
55,000 
58,008 
62,380 
66,000 
70,107 
74,500 


82,300 


e a Calculate the sample mean and the sample standard deviation 


for the maximum capacity of sports stadiums (the data). 


e b Construct a histogram of the data. 
e c Draw a smooth curve through the midpoints of the tops of the 


bars of the histogram. 


e dIn words, describe the shape of your histogram and smooth 


curve. 


e e Let the sample mean approximate p and the sample standard 
deviation approximate o. The distribution of X can then be 


approximated by X~ 


e f Use the distribution in (e) to calculate the probability that the 


maximum capacity of sports stadiums is less than 67,000 


spectators. 
e g Determine the cumulative relative frequency that the maximum 
capacity of sports stadiums is less than 67,000 spectators. Hint: 


Order the data and count the sports stadiums that have a 


maximum capacity less than 67,000. Divide by the total number 
of sports stadiums in the sample. 
e h Why aren’t the answers to (f) and (g) exactly the same? 


Solution: 


e a The sample mean is 60,136.4 and the sample standard deviation 
is 10,468.1. 

e e N(60136.4,10468.1) 

¢ f 0.7440 

e« g 0.7167 


Try These Multiple Choice Questions 


The questions below refer to the following: The patient recovery time 
from a particular surgical procedure is normally distributed with a mean of 
5.3 days and a standard deviation of 2.1 days. 

Exercise: 


Problem: What is the median recovery time? 


© A2.7 
e B53 
e C74 
e D2.1 


Solution: 


B 
Exercise: 


Problem: 


What is the z-score for a patient who takes 10 days to recover? 


Al.5 
B 0.2 
C 2.2 
D 7.3 


Solution: 


C 
Exercise: 


Problem: 
What is the probability of spending more than 2 days in recovery? 


e A 0.0580 
e B 0.8447 
e € 0.0553 
e D 0.9420 


Solution: 


D 


Exercise: 


Problem: The 90th percentile for recovery times is? 


e A8.89 
¢ B7.07 
e C 7.99 
e D432 


Solution: 


@ 


The questions below refer to the following: The length of time to find a 
parking space at 9 A.M. follows a normal distribution with a mean of 5 
minutes and a standard deviation of 2 minutes. 

Exercise: 


Problem: 


Based upon the above information and numerically justified, would 
you be surprised if it took less than 1 minute to find a parking space? 


e A Yes 
e BNo 
e C Unable to determine 


Solution: 


A 
Exercise: 


Problem: 


Find the probability that it takes at least 8 minutes to find a parking 
space. 


e A 0.0001 
¢ B 0.9270 
e C 0.1862 
e D 0.0668 


Solution: 


D 


Exercise: 


Problem: 


Seventy percent of the time, it takes more than how many minutes to 
find a parking space? 


© A1.24 
e B2.41 
e €3.95 
e D6.05 


Solution: 


G 
Exercise: 
Problem: 


If the mean is significantly greater than the standard deviation, which 
of the following statements is true? 


e I The data cannot follow the uniform distribution. 
e II The data cannot follow the exponential distribution.. 
e III The data cannot follow the normal distribution. 


¢ Alonly 

¢ BI only 

¢ C Ill only 

e DI, Ul, and Ill 


Solution: 


B 


8.7 Normal Distribution: Review TCC 


The next two questions refer to: X ~ U(3, 13) 
Exercise: 


Problem: Explain which of the following are false and which are true. 


° a f(z) = 75,3 <2< 13 
e b There is no mode. 


e c The median is less than the mean. 
e d P(X > 10) = P(X < 6) 


Solution: 


e a True 

e b True 

e c False — the median and the mean are the same for this 
symmetric distribution 

e d True 


Exercise: 


Problem: Calculate: 


e a Mean 
e b Median 
e c 65th percentile. 


Solution: 


e a8 


2 b8 
© c P(X <k) = 0.65 = (k—3) + (4). k=9.5 


Exercise: 


Problem: Which of the following is true for the above box plot? 


e a 25% of the data are at most 5. 

¢ b There is about the same amount of data from 4 — 5 as there is 
from 5— 7. 

e c There are no data values of 3. 

e d 50% of the data are 4. 


Solution: 


e a False — + of the data are at most 5 


e b True — each quartile has 25% of the data 
e c False — that is unknown 
e d False — 50% of the data are 4 or less 


Exercise: 


Problem: 
If P(G | H) = P(G)), then which of the following is correct? 


e AG and H are mutually exclusive events. 

¢ BP(G) = P(A) 

e C Knowing that H has occurred will affect the chance that G will 
happen. 

¢ DG and H are independent events. 


Solution: 


D 


Exercise: 


Problem: 


If P(J) = 0.3, P(K) = 0.6, and J and K are independent events, 
then explain which are correct and which are incorrect. 


© AP 
° BP 
© CP 
« DP 


Jand Kk) =0 
Jor kK) = 0,9 
J or Kk) = 0.72 
J) # PJ | K) 


LN LN NON 


Solution: 


e A False - J and K are independent so they are not mutually 
exclusive which would imply dependency (meaning P(J and K) is 
not 0). 

e B False - see answer C. 

e C True - P(J or K) = P(J) + P(K) - P(J and K) = P(J) + P(K) - 
P(J)P(K) = 0.3 + 0.6 - (0.3)(0.6) = 0.72. Note that P(J and K) = 
P(J)P(K) because J and K are independent. 

¢ D False - J and K are independent so P(J) = P(J|K). 


